Speech retrieval device, speech retrieval method, and display device

ABSTRACT

A voice search apparatus includes a web page display controller, a voice recognizer, a matching unit, and a URL creator. The web page display controller controls a web page which is to be displayed on a display. The voice recognizer recognizes a voice that is input, converts the voice into character string information, and outputs the character string information. The matching unit refers to a URL list in which a character string representing part of a URL is registered for each search category, performs matching processing between a URL of the web page displayed on the display and the character string in the URL list, and specifies the character string in the URL list corresponding to the URL. The URL creator creates a URL based on the character string specified by the matching unit and the character string information.

TECHNICAL FIELD

The present disclosure relates to voice search apparatuses, voice searchmethods, and display apparatuses that operate by recognizing a voiceuttered by a user.

BACKGROUND ART

Patent Literature 1 discloses a voice input apparatus that has a voicerecognition function. This voice input apparatus is configured toreceive a voice uttered by a user, to recognize an instruction indicatedby the voice of the user (hereinafter referred to as “voicerecognition”) by analyzing the received voice, and to control a devicein response to the voice-recognized instruction. That is, the voiceinput apparatus of Patent Literature 1 is capable of performing voicerecognition on the voice arbitrarily uttered by the user, andcontrolling the device in response to the instruction that is a resultof the voice recognition (hereinafter referred to as “command”).Hereinafter, an operation of the device using the voice recognitionfunction is referred to as “voice operation”.

The user who uses this voice input apparatus may perform a search on aweb site that provides a search service on the Internet (hereinafterreferred to as a “search site”), for example, while operating a devicesuch as a television receiver (hereinafter referred to as “television”)and PC (Personal Computer) by using this voice recognition function ofthe voice input apparatus. Hereinafter, the search by the voiceoperation is referred to as “voice search”. Data to be searched at thistime is classified, for example, into a plurality of categories such asweb pages, images, and maps. The user may also use this voicerecognition function of the voice input apparatus to search data thatbelongs to a category different from the above categories, such asprogram guide data and recorded videos retained in the television.

Patent Literature 2 discloses an information search method forperforming information search using a table (correspondence table) thatassociates keywords with categories. In case that a voice uttered by theuser can be classified into either of the categories registered in thetable, this information search method allows voice recognition of thevoice uttered by the user, and allows a search of the category by usingthe keyword included in a character string resulting from the voicerecognition.

For example, when performing a voice search of the category of “images”for an image of “ABC”, the user who uses the information search methoddescribed in Patent Literature 2 may utter “Search images for ABC”. Thiscauses a search site to search the category of “images” for “ABC”. Whensearching for an image of “XYZ” continuously, the user may utter, forexample, “Search images for XYZ”.

CITATION LIST Patent Literature

PTL 1: Japanese patent No. 3292190

PTL 2: Unexamined Japanese Patent Publication No. 2001-75968

SUMMARY

The present disclosure provides a voice search apparatus and voicesearch method for improving user operativity.

The voice search apparatus according to the present disclosure includesa web page display controller, a voice recognizer, a matching unit, anda URL creator. The web page display controller controls a web page whichis to be displayed on a display. The voice recognizer recognizes a voicethat is input, converts the voice into character string information, andoutputs the character string information. The matching unit refers to aURL list in which a character string representing part of a URL isregistered for each search category, performs matching processingbetween a URL of the web page displayed on the display and the characterstring in the URL list, and specifies the character string in the URLlist corresponding to the URL. The URL creator creates a URL based onthe character string specified by the matching unit and the characterstring information.

The voice search method according to the present disclosure includes:displaying a web page on a display; recognizing a voice that is inputand converting the voice into character string information; referring toa URL list in which a character string representing part of a URL isregistered for each search category, performing matching processingbetween a URL of the web page displayed on the display and the characterstring in the URL list, and specifying the character string in the URLlist corresponding to the URL; and creating a URL based on the specifiedcharacter string and the character string information.

A display apparatus according to the present disclosure includes adisplay, a web page display controller, a voice recognizer, a matchingunit, and a URL creator. The web page display controller controls a webpage which is to be displayed on a display. The voice recognizerrecognizes a voice that is input, converts the voice into characterstring information, and outputs the character string information. Thematching unit refers to a URL list in which a character stringrepresenting part of a URL is registered for each search category,performs matching processing between a URL of the web page displayed onthe display and the character string in the URL list, and specifies thecharacter string in the URL list corresponding to the URL. The URLcreator creates a URL based on the character string specified by thematching unit and the character string information.

The voice search apparatus according to the present disclosure mayimprove operativity when the user performs a voice operation.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a diagram schematically illustrating a voice recognitionsystem according to a first exemplary embodiment.

FIG. 2 is a block diagram illustrating a configuration example of thevoice recognition system according to the first exemplary embodiment.

FIG. 3 is a diagram schematically illustrating an example of a reservedword table stored in a storage of a voice recognition apparatusaccording to the first exemplary embodiment.

FIG. 4 is a diagram schematically illustrating an example of a searchURL list stored in the storage of the voice recognition apparatusaccording to the first exemplary embodiment.

FIG. 5 is a diagram illustrating an outline of dictation to be performedby the voice recognition system according to the first exemplaryembodiment.

FIG. 6 is a diagram illustrating an example of a web browser displayedon a display of a television receiver according to the first exemplaryembodiment.

FIG. 7 is a flow chart illustrating an operation example of voice searchprocessing to be performed by the voice recognition system according tothe first exemplary embodiment.

FIG. 8 is a flow chart illustrating an operation example of voicerecognition intention interpretation processing to be performed by thevoice recognition system according to the first exemplary embodiment.

FIG. 9 is a flow chart illustrating an operation example of searchcategory determination processing to be performed by the voicerecognition system according to the first exemplary embodiment.

DESCRIPTION OF EMBODIMENTS

Exemplary embodiments will be described in detail below with referenceto the drawings as necessary. However, a description that is moredetailed than necessary may be omitted. For example, a detaileddescription of an already well-known item and a repeated description ofsubstantially identical components may be omitted. This is intended toavoid the following description from becoming unnecessarily redundantand to make the description easier for a person skilled in the art tounderstand.

Note that the accompanying drawings and the following description areprovided in order for a person skilled in the art to fully understandthe present disclosure, and are not intended to limit the subjectdescribed in the appended claims.

First Exemplary Embodiment

A first exemplary embodiment will be described below with reference toFIG. 1 to FIG. 9. Note that although television receiver (television) 10is cited in the present exemplary embodiment as an example of a displayapparatus including a voice search apparatus, the display apparatus isnot limited to television 10. For example, the display apparatus may bean apparatus such as a PC and a tablet terminal.

[1-1. Configuration]

FIG. 1 is a diagram schematically illustrating voice recognition system11 according to the first exemplary embodiment. In the present exemplaryembodiment, the voice search apparatus is incorporated in television 10,which is an example of the display apparatus. The voice recognitionapparatus included in the voice search apparatus incorporated intelevision 10 recognizes a voice uttered by a user, and controlstelevision 10 in accordance with a result of the voice recognition.

As illustrated in FIG. 1, voice recognition system 11 according to thepresent exemplary embodiment includes television 10 and voice recognizer50. In addition, voice recognition system 11 may also include at leastone of remote controller (hereinafter also referred to as “remocon”) 20and mobile terminal 30.

When the voice recognition apparatus starts in television 10, voicerecognition icon 201 and indicator 202 indicating volume of a collectedvoice are displayed on display 140 of television 10, together with avideo based on a signal such as an input video signal and a receivedbroadcast signal. This is for indicating user 700 that an operation oftelevision 10 based on a voice of user 700 (voice operation) isavailable and for prompting user 700 to utter a voice.

When user 700 utters a voice toward remote controller 20 or mobileterminal 30, the voice will be collected by a microphone incorporated inremote controller 20 or in mobile terminal 30, and will be transferredto television 10. Then, the voice uttered by user 700 undergoes voicerecognition by the voice recognition apparatus incorporated intelevision 10. In television 10, control of television 10 is performedin accordance with the result of the voice recognition.

Television 10 may include microphone 130. In this case, when user 700utters a voice toward microphone 130 included in television 10, thevoice will be collected by microphone 130 and undergo voice recognitionby the voice recognition apparatus. Therefore, it is also possible toconfigure voice recognition system 11 to include neither remotecontroller 20 nor mobile terminal 30.

Also, television 10 is connected to voice recognizer 50 via network 40.Television 10 may communicate with voice recognizer 50 via network 40.

FIG. 2 is a block diagram illustrating a configuration example of voicerecognition system 11 according to the first exemplary embodiment. Notethat remote controller 20 and mobile terminal 30 are omitted in FIG. 2.

Television 10 includes voice search apparatus 200, tuner 190, displaycontroller 108, microphone 130, display 140, storage 181, operationacceptor 110, and transmitter-receiver 150. Television 10 may include awireless communicator (not illustrated) that performs wirelesscommunications with remote controller 20 or mobile terminal 30.

Voice search apparatus 200 includes voice recognition apparatus 100 andweb browser controller 170.

Voice recognition apparatus 100 is configured to acquire the voiceuttered by user 700 and to analyze the acquired voice. Voice recognitionapparatus 100 is configured to then recognize a “free word” and“reserved word” that the voice indicates, and to control television 10in accordance with the result of the recognition. The specificconfiguration of voice recognition apparatus 100 will be describedlater. Note that the “reserved word” is a previously determined term,and is a term registered in a reserved word table to be described later.

Microphone 130 is a microphone configured to collect a voice that mainlycomes from a direction facing a display surface of display 140. That is,a sound-collecting direction of microphone 130 is set so as to collectthe voice uttered by user 700 who faces display 140 of television 10.Microphone 130 may collect the voice uttered by user 700 accordingly.Microphone 130 may be provided inside an enclosure of television 10, andas illustrated in an example of FIG. 1, may be installed outside theenclosure of television 10.

Remote controller 20 illustrated in FIG. 1 is a controller for user 700to perform remote control of television 10. In addition to a generalconfiguration required for remote control of television 10, remotecontroller 20 includes a microphone and an input unit. The microphone isconfigured to collect the voice uttered by user 700 and to output avoice signal. The input unit is configured to accept an input operationperformed by user 700 manually, and to output an input signal inresponse to the input operation. This input unit, which is, for example,a touchpad, may also be a keyboard, a button, or the like. The voicesignal generated from the voice collected by the microphone, or theinput signal generated by user 700 performing the input operation on theinput unit is wirelessly transmitted to television 10 by, for example,infrared rays, electromagnetic waves, or any other wireless medium.

Display controller 108 performs processing such as decoding andmultiplexing of a plurality of pieces of image data, such as image datagenerated by web browser controller 170 and image data based on abroadcast signal that tuner 190 receives via an antenna (notillustrated). Display controller 108 then generates an image signal ofan image to be displayed on display 140. Then, display controller 108controls display 140, and displays the image based on those imagesignals on display 140.

Display 140, which is, for example, a liquid crystal display, may alsobe a display such as a plasma display and an organic EL (ElectroLuminescence) display. Display 140 is controlled by display controller108, and displays an image based on image signals such as an externalinput video signal and an image signal generated by display controller108 based on a signal such as the broadcast signal received by tuner190.

Transmitter-receiver 150 is connected to network 40, and is configuredto communicate via network 40 with an external device (for example,voice recognizer 50, URL server 60, search site 70, and any otherdevice) connected to network 40.

Tuner 190 is configured to receive a television broadcast signal ofterrestrial broadcasting or satellite broadcasting via an antenna (notillustrated). Tuner 190 may be configured to receive the televisionbroadcast signal transmitted via a cable.

Storage 181, which is, for example, a nonvolatile semiconductor memory,may be a device such as a volatile semiconductor memory and a hard disk.Storage 181 stores information (data), a program, and the like to beused for control of each unit of television 10.

Mobile terminal 30 illustrated in FIG. 1 is, for example, a smart phone,on which software for remotely controlling television 10 may run.Therefore, in voice recognition system 11 according to the presentexemplary embodiment, mobile terminal 30 on which the software isrunning may be used for remote control of television 10. Mobile terminal30 includes a microphone and an input unit. In a similar manner to themicrophone included in remote controller 20, the microphone incorporatedin mobile terminal 30 is configured to collect the voice uttered by user700 and to output a voice signal. The input unit is configured to acceptan input operation performed by user 700 manually, and to output aninput signal in response to the input operation. The input unit, whichis, for example, a touch panel, may also be a keyboard, a button, etc.In a similar manner to remote controller 20, mobile terminal 30 on whichthe software is running wirelessly transmits, to television 10, thevoice signal generated from the voice collected by the microphone, orthe input signal generated by user 700 performing the input operation onthe input unit by, for example, infrared rays, electromagnetic waves, orany other wireless medium.

Television 10, and remote controller 20 or mobile terminal 30 areconnected by wireless communications, such as, for example, wireless LAN(Local Area Network) and Bluetooth (registered trademark).

Network 40, which is, for example, the Internet, may be another network.

Voice recognizer 50 is a server connected to television 10 via network40 (for example, a server on a cloud). Voice recognizer 50 receivesvoice information transmitted from television 10, and converts thereceived voice information into a character string. Note that thischaracter string may be a plurality of characters, and may be onecharacter. Then, voice recognizer 50 transmits character stringinformation (text information) that indicates the converted characterstring to television 10 via network 40 as a result of the voicerecognition.

Search site 70 is a web-site server, connected to network 40, forproviding a search service on network 40. One server may be used forsearch site 70, and two or more servers may be used. The server forsearch site 70 is configured to provide the following services. Whenuser 700 performs a user operation for a purpose of search with aterminal connected to network 40 (for example, television 10), acharacter string that indicates a query string based on the useroperation and a path is set in a URL bar (Uniform Resource Locator bar)of a web browser. This URL bar includes information on the characterstring based on the user operation and information that specifies aspecific server (for example, a server for performing a search). Theserver for search site 70 specified by the URL bar receives thecharacter string information in the URL bar, and performs a search basedon the character string information. The server then returns informationthat indicates a result of the search (information that indicates a webpage resulting from the search) to the terminal.

Web browser controller 170 of television 10 is configured to performinformation exchange described above with each server for search site70.

URL server 60 is a server connected to television 10 via network 40. URLserver 60 stores a search URL list. The search URL list corresponds to atable (list) that associates a template for the character string to beset in the URL bar of the web browser (hereinafter referred to as a“search URL template”), with a category to be searched (hereinafterreferred to as a “search category”). The search category refers to asearch range for performing a search with a free word. In this searchURL list, a character string that indicates information such as acommunication method (for example, “http” (Hyper Text TransferProtocol)), each server name for search site 70 (for example,“search.co.jp”, or the like), and a search category (for example,“images”, or the like) is registered as a search URL template. Detailsof the search URL list will be described later. The search URL list inURL server 60 is updatably configured, and URL server 60 may update thesearch URL list regularly or irregularly by collecting information fromnetwork 40 or any other method.

Television 10 is configured to be capable of acquiring the search URLlist from URL server 60 via network 40. This allows television 10 toupdate search URL list D500 stored in storage 180 of television 10 asnecessary.

Voice recognition apparatus 100 of television 10 includes voiceacquisitor 101, voice processor 102, recognition result acquisitor 103,intention interpretation processor 104, reserved word processor 106, andstorage 180.

Voice acquisitor 101 is configured to acquire the voice signal generatedfrom the voice uttered by the user. Voice acquisitor 101 may acquire thevoice signal generated from the voice uttered by the user, frommicrophone 130 incorporated in television 10, from a microphoneincorporated in remote controller 20, or a microphone incorporated inmobile terminal 30 via a wireless communicator (not illustrated). Then,voice acquisitor 101 converts the voice signal into voice informationthat may be used for various types of downstream processing, and outputsthe voice information to voice processor 102. Note that when the voicesignal is a digital signal, voice acquisitor 101 may use the voicesignal as it is as the voice information.

Voice processor 102 is configured to perform “reserved word recognitionprocessing” and to convert the voice information into reserved wordinformation. The “reserved word recognition processing” refers toprocessing for determining whether a preset reserved word is included inthe voice information acquired from voice acquisitor 101, and ifincluded, for specifying the reserved word. Specifically, voiceprocessor 102 refers to reserved word table D400 previously stored instorage 180, based on the voice information acquired from voiceacquisitor 101. Reserved word table D400 is an association table inwhich the voice information is associated with reserved words. Theplurality of reserved words is registered in reserved word table D400,and each of the reserved words is associated with a piece of voiceinformation different from one another. Therefore, voice processor 102may determine whether the reserved word is included in the voiceinformation by collating the voice information with reserved word tableD400.

Here, reserved word table D400 will be described with reference to FIG.3. FIG. 3 is a diagram schematically illustrating an example of reservedword table D400 stored in storage 180 of voice recognition apparatus 100according to the first exemplary embodiment.

Reserved word table D400 illustrated as an example in FIG. 3 is storedin storage 180 of voice recognition apparatus 100. The plurality ofreserved words and information that indicates attributes of the reservedwords are associated with each other and registered in reserved wordtable D400.

In the example illustrated in FIG. 3, in reserved word table D400, wordssuch as “images”, “videos”, “program”, “Web”, “news”, “maps”,“restaurant”, “gas station”, “proceed”, “return”, and “search” areregistered as the reserved words, and words such as “search” and“command” are registered as the attributes for the reserved words. Inreserved word table D400, words to be used for directions to television10 (hereinafter referred to as “direction information”) are classifiedinto “command”, and words to be used for specification of the searchcategory are classified into “search”. In reserved word table D400, theattribute of each of “images”, “videos”, “program”, “Web”, “news”,“maps”, “restaurant”, and “gas station” is “search”, and the attributeof each of “proceed”, “return”, and “search” is “command”. Note that theword that specifies the search category refers to a word that specifiesthe search range for performing a search with a free word.

Voice processor 102 refers to reserved word table D400. When thereserved word included in the voice information acquired from voiceacquisitor 101 is specified, voice processor 102 outputs “reserved wordinformation” to recognition result acquisitor 103 as a result of thevoice recognition. The “reserved word information” refers to informationthat represents the reserved word included in the voice information andthe attribute of the reserved word.

For example, when the reserved word of “images” is included in the voiceinformation, voice processor 102 collates the voice information withreserved word table D400, and outputs the reserved word information thatindicates that the reserved word included in the voice information is“images”, and that the attribute of the reserved word is “search”, torecognition result acquisitor 103.

Note that reserved word table D400 illustrated in FIG. 3 is a mereexample, and a word other than the reserved words illustrated in FIG. 3may be registered in reserved word table D400 as the reserved word. Inaddition, a reserved word with an attribute other than “search” and“command” may be registered in reserved word table D400.

In addition, voice processor 102 transmits the voice informationacquired from voice acquisitor 101, from transmitter-receiver 150 vianetwork 40 to voice recognizer 50.

Voice recognizer 50 is configured to perform “free word recognitionprocessing” and to convert the voice information into the characterstring information (text information). On receipt of the voiceinformation transmitted from television 10, voice recognizer 50separates the voice information into clauses in order to distinguish afree word from words other than the free word, and converts each clauseinto a character string (hereinafter referred to as “dictation”).Examples of the words other than the free word include a particle, apreposition, and any other word. Then, voice recognizer 50 transmitsinformation on the character string after dictation (character stringinformation) to television 10 as a result of the voice recognition. Thatis, voice recognizer 50 recognizes the voice that is input, converts thevoice into the character string information, and outputs the characterstring information. Note that the reserved word may be included in thefree word in voice recognizer 50.

Note that voice recognizer 50 is configured to receive all the voiceinformation transmitted from television 10, to dictate all the receivedvoice information, and to transmit all the character string informationobtained as a result of the dictation to television 10. However, thepresent exemplary embodiment is not limited to this configuration. Forexample, voice recognizer 50 may acquire voice information other thanthe reserved word from the received voice information, or may convertthe voice information other than the reserved word from the receivedvoice information into a character string, and may return the characterstring. Alternatively, television 10 may transmit the voice informationother than the reserved word to voice recognizer 50.

Recognition result acquisitor 103 acquires the reserved word informationas the result of the voice recognition from voice processor 102. Inaddition, recognition result acquisitor 103 acquires the characterstring information as the result of the voice recognition from voicerecognizer 50 via network 40 and transmitter-receiver 150. Then,recognition result acquisitor 103 outputs the acquired reserved wordinformation and character string information to intention interpretationprocessor 104.

Intention interpretation processor 104 is an example of “a sorter”.Intention interpretation processor 104 is configured to sort thereserved word information and free word information from the characterstring information. On acquisition of the reserved word information andcharacter string information from recognition result acquisitor 103,intention interpretation processor 104 sorts the “free word” and the“reserved word” from the character string information, based on thereserved word information. Note that the free word in intentioninterpretation processor 104 corresponds to a character string obtainedby removing words such as the reserved word, particle, and prepositionfrom the character string information. Therefore, the free word inintention interpretation processor 104 does not include the reservedword. In addition, intention interpretation processor 104 extractsinformation that indicates the attribute of the reserved word from thereserved word information. Then, based on the sorted free word, reservedword, and the information that indicates the attribute of the reservedword, intention interpretation processor 104 performs intentioninterpretation for specifying intention of the voice operation utteredby user 700. Details of the operation of intention interpretationprocessor 104 will be described later. Each piece of information whichis output from intention interpretation processor 104 and whichindicates the free word, reserved word, and attribute of the reservedword (free word information and reserved word information) is input intoreserved word processor 106.

Reserved word processor 106 is configured to confirm, on acquisition ofthe information from intention interpretation processor 104, whether theinformation includes predetermined information. This predeterminedinformation corresponds to information that represents a “reserved wordwith the attribute of “search””. Web browser controller 170 according tothe present exemplary embodiment is configured to switch subsequentprocessing depending on whether the reserved word with the attribute of“search” is included in the information that is output from intentioninterpretation processor 104. Specifically, when reserved word processor106 determines that the reserved word with the attribute of “search” isincluded in the information that is output from intention interpretationprocessor 104, search URL creator 171 of web browser controller 170executes processing. On the other hand, when reserved word processor 106determines that the reserved word with the attribute of “search” is notincluded in the information that is output from intention interpretationprocessor 104, search URL matching processor 172 of web browsercontroller 170 executes processing.

Note that reserved word processor 106 determines that the reserved wordwith the attribute of “search” is not included in the information thatis output from intention interpretation processor 104, when reservedword processor 106 determines that the reserved word is not included andonly the free word is included in the information that is output fromintention interpretation processor 104, or when reserved word processor106 determines that only the reserved word with the attribute other than“search” is included. Details of these operations will be describedlater.

Storage 180, which is, for example, a nonvolatile semiconductor memory,may be a device such as a volatile semiconductor memory and a hard disk.Storage 180 is configured to allow writing and reading of dataarbitrarily. Storage 180 stores reserved word table D400 to be referredto by voice processor 102, search URL list D500 to be referred to by webbrowser controller 170, and any other data. Note that storage 180 andstorage 181 may be integrated into one unit.

Web browser controller 170 of television 10 includes search URL creator171, search URL matching processor 172, URL bar display controller 173,and web page display controller 174.

Search URL creator 171 is an example of a “URL creator”. Search URLcreator 171 is configured, when the free word and the reserved word withthe attribute of “search” are included in the information that is outputfrom intention interpretation processor 104, to refer to search URL listD500 stored in storage 180 based on the reserved word.

A character string that represents part of the URL is registered insearch URL list D500 for each reserved word. In search URL list D500,the reserved word indicates the range to be searched (that is, thesearch category).

Here, search URL list D500 will be described with reference to FIG. 4.FIG. 4 is a diagram schematically illustrating an example of search URLlist D500 stored in storage 180 of voice recognition apparatus 100according to the first exemplary embodiment.

Search URL list D500, with an example illustrated in FIG. 4, is storedin storage 180 of voice recognition apparatus 100. The plurality ofreserved words with the attribute of “search” and the search URLtemplates to be used for search of the range to be searched indicated bythe reserved words are associated with each other and registered insearch URL list D500. That is, search URL list D500 is a “URL list” inwhich the character strings that represent part of the URLs areregistered for respective reserved words (search categories), and is alist of the search URL templates in which the “reserved words (searchcategories)” are registered as index.

In the example illustrated in FIG. 4, for example, when the reservedword is “images”, the search URL template corresponding to the reservedword is as follows.

http://search.co.jp/images?q=

In this search URL template, “http” denotes a communication method,“search.co.jp” denotes a server name for search site 70 to which asearch request is made, and “images” denotes that the category to besearched is “images”. Also, a free word is added after “?q=”. Thus, thesearch URL templates different from one another are registered forrespective reserved words in search URL list D500.

Then, search URL creator 171 refers to search URL list D500, and readsthe search URL template corresponding to the reserved word from searchURL list D500.

For example, it is assumed that character strings of the reserved wordof “images” and the free word of “ABC” are included in output fromintention interpretation processor 104. In this case, search URL creator171 reads the search URL template corresponding to the reserved word of“images” from search URL list D500. Then, search URL creator 171 addsthe character string of “ABC”, which is a free word, to a final part ofthe read search URL template. As a result, if based on the exampleillustrated in FIG. 4, the following character string is created.

http://search.co.jp/images?q=ABC

This character string will be a new URL (that is, search URL) to be usedfor search processing.

Search URL creator 171 thus generates the URL to be used for searchprocessing (search URL), based on the reserved word with the attributeof “search” (FIG. 3), search URL list D500 (FIG. 4), and the characterstring of the free word.

Search URL matching processor 172 is an example of a “matching unit”.Search URL matching processor 172 is configured to perform the followingprocessing when the reserved word with the attribute of “search” is notincluded in the information that is output from reserved word processor106.

Search URL matching processor 172 acquires information on the URL fromURL bar display controller 173. This URL is a URL managed by URL bardisplay controller 173 as a URL for displaying a web page displayed ondisplay 140 of television 10.

The following indicates an example of the URL managed by URL bar displaycontroller 173. For example, when a web page for a map search with thefree word of “ABC” is displayed on display 140 of television 10, thefollowing URL is managed by URL bar display controller 173.

http://search.co.jp/maps?q=ABC

Alternatively, when a web page for a restaurant search with the freeword of “ABC” is displayed on display 140, the following URL is managedby URL bar display controller 173.

http://web-navi.co.jp/search?category=restaurant&query=ABC

Then, search URL matching processor 172 performs collation (that is,matching processing) between the URL acquired from URL bar displaycontroller 173 and the search URL templates in search URL list D500stored in storage 180.

Search URL matching processor 172 reads the search URL template fromsearch URL list D500 sequentially, and examines whether the characterstring of the read search URL template is included in the URL acquiredfrom URL bar display controller 173. Specifically, search URL matchingprocessor 172 examines whether the character string of the URL acquiredfrom URL bar display controller 173 includes a character string thatagrees with the character string of the read search URL template. Whenthe search URL template that is determined to include the agreeingcharacter string is included in search URL list D500, search URLmatching processor 172 determines that the matching processing hassucceeded. Then, search URL matching processor 172 reads the search URLtemplate, and adds the character string of the free word that is outputfrom reserved word processor 106 to the final part of the characterstring of the read search URL template. Search URL matching processor172 thus creates the new URL to be used for search processing (that is,search URL).

Note that voice recognition apparatus 100 may acquire the search URLlist from URL server 60 regularly or irregularly, and may use theacquired search URL list to update search URL list D500 stored instorage 180. Alternatively, voice recognition apparatus 100 may acquirethe search URL list from URL server 60 at timing when search URL creator171 or search URL matching processor 172 uses search URL list D500. Notethat details of processing for determining the above search categorywill be described later.

URL bar display controller 173 is configured to manage the URL of theweb page that web page display controller 174 displays on display 140via display controller 108. Also, URL bar display controller 173 isconfigured to control the character string to be displayed on the URLbar.

Web page display controller 174 includes components such as a parseprocessor, a DOM (Document Object Model) creator, a renderer, and aJavaScript (registered trademark) engine (no component is illustrated).Also, web page display controller 174 displays, on display 140 viadisplay controller 108, files acquired from a website or a localstorage, such as HTML (Hyper Text Markup Language), CSS (Cascading StyleSheets), PNG (Portable Network Graphics), and JavaScript (registeredtrademark).

Operation acceptor 110 receives an input signal generated by an inputoperation performed by user 700 with an input unit of remote controller20, or an input signal generated by an input operation performed by user700 with an input unit of mobile terminal 30, from remote controller 20or mobile terminal 30 via a wireless communicator (not illustrated). Inthis way, operation acceptor 110 accepts the operation performed by user700 (that is, user operation).

[1-2. Operation]

Next, an operation of voice search apparatus 200 of television 10according to the present exemplary embodiment will be described.

First, methods for starting voice recognition processing by voicerecognition apparatus 100 of television 10 will be described. Themethods for starting voice recognition processing by voice recognitionapparatus 100 mainly include the following two methods.

The first method for starting is as follows. In order to start voicerecognition processing, user 700 presses a microphone button (notillustrated) that is one of the input unit provided in remote controller20. When user 700 presses the microphone button of remote controller 20,in television 10, operation acceptor 110 accepts that the microphonebutton of remote controller 20 is pressed. Then, television 10 altersvolume of a speaker (not illustrated) of television 10 into presetvolume. This volume is sufficiently low volume to avoid disturbance ofvoice recognition by the microphone of remote controller 20. Then, whenthe volume of the speaker of television 10 becomes the preset volume,voice recognition apparatus 100 starts voice recognition processing. Atthis time, when the volume of the speaker is equal to or lower than thepreset volume, television 10 does not need to perform the above volumeadjustment, and leaves the volume as it is.

Note that this method may also use mobile terminal 30 (for example, asmart phone including a touch panel) instead of remote controller 20. Inthis case, user 700 starts software (for example, software forperforming voice operation of television 10) included in mobile terminal30, and presses the microphone button displayed on the touch panel bythe software running. This user operation corresponds to a useroperation of pressing the microphone button of remote controller 20.This causes voice recognition apparatus 100 to start voice recognitionprocessing.

The second method for starting is as follows. User 700 utters a voicerepresenting a preset command to start voice recognition processing(start command), to microphone 130 of television 10. Note that thisvoice is, for example, “Hi, television”, “start voice operation”, or anyother voice. However, the start command for voice recognition processingis not limited to these voices, and may be another voice. When voicerecognition apparatus 100 recognizes that the voice collected bymicrophone 130 is the preset start command, television 10 alters thevolume of the speaker of television 10 into the preset volume in asimilar manner to the above method, and voice recognition apparatus 100starts voice recognition processing.

Note that the above methods may be combined to define a method forstarting voice recognition processing by voice recognition apparatus100.

It is assumed that these types of control in television 10 are performedby a controller (not illustrated) that controls each block of television10.

When voice recognition apparatus 100 starts voice recognitionprocessing, in order to prompt user 700 to utter a voice, as illustratedin FIG. 1 for example, display controller 108 displays, on an imagedisplay screen of display 140, voice recognition icon 201 indicatingthat voice recognition processing has started and that voice operationby user 700 has become available, and indicator 202 indicating volume ofthe voice that is being collected.

Note that display controller 108 may display, on display 140, a messageindicating that voice recognition processing has started, instead ofvoice recognition icon 201. Alternatively, display controller 108 mayoutput a message indicating that voice recognition processing hasstarted, with a voice from the speaker.

Note that voice recognition icon 201 and indicator 202 are not limitedto designs illustrated in FIG. 1. Any design may be used as long as anintended effect is obtained.

Next, voice recognition processing to be performed by voice recognitionapparatus 100 of television 10 will be described.

In the present exemplary embodiment, voice recognition apparatus 100performs a first type and a second type of voice recognition processing.The first type is voice recognition processing for recognizing a voicecorresponding to a preset reserved word (hereinafter referred to as“reserved word recognition processing”). The second type is voicerecognition processing for recognizing a word other than the presetreserved word as the free word (hereinafter referred to as “free wordrecognition processing”).

As described above, the reserved word recognition processing isperformed by voice processor 102 included in voice recognition apparatus100. Voice processor 102 compares the voice information based on thevoice uttered by user 700 to television 10 with reserved word table D400previously stored in storage 180. Then, when the voice informationincludes the reserved word registered in reserved word table D400, voiceprocessor 102 specifies the reserved word.

The free word recognition processing is performed using voice recognizer50 connected to television 10 via network 40, as described above. Voicerecognizer 50 acquires the voice information from television 10 vianetwork 40. Then, voice recognizer 50 separates the acquired voiceinformation into clauses, and isolates a free word from words other thanthe free word (for example, particles, prepositions, and any otherwords). In this way, voice recognizer 50 performs dictation. Whenperforming dictation, voice recognizer 50 uses a database thatassociates the voice information with the character string. Voicerecognizer 50 compares the acquired voice information with the databaseto isolate the free word from words other than the free word, andconverts each word into a character string.

Thus, the reserved word recognition processing and the free wordrecognition processing are performed as voice recognition processing invoice recognition apparatus 100 of television 10, and television 10performs processing based on results of the reserved word recognitionprocessing and the free word recognition processing. Television 10 isconfigured, when the reserved word is a command (hereinafter referred toas “search command”) associated with a preset application for search(hereinafter referred to as “search application”), to perform free wordsearch with the search application.

For example, when user 700 utters “Search images for ABC”, voicerecognition apparatus 100 determines that the acquired voice includesthe free word of “ABC”, the reserved word of “images” with the attributeof “search”, and the reserved word of “search” with the attribute of“command”. As a result, television 10 performs processing for searchingthe reserved word (search category) of “images” for the free word of“ABC”.

Note that, in the present exemplary embodiment, voice recognizer 50 isconfigured to receive from television 10 all the voices (voiceinformation) acquired by voice acquisitor 101, to perform dictation ofall pieces of the voice information, and to transmit all pieces ofresulting character string information to television 10. However, voiceprocessor 102 of television 10 may be configured to transmit the voiceinformation other than the reserved word recognized using reserved wordtable D400 to voice recognizer 50.

Next, the free word recognition processing will be described withreference to FIG. 5.

FIG. 5 is a diagram illustrating an outline of dictation performed byvoice recognition system 11 according to the first exemplary embodiment.

FIG. 5 illustrates a state where a web browser is displayed on display140 of television 10. For example, when user 700 performs a search witha free word (that is, free word search) using an Internet searchapplication of the web browser, an image illustrated in FIG. 5 as anexample is displayed on display 140.

Note that the Internet search application of the web browser illustratedin FIG. 5 as an example may start based on the above reserved wordrecognition processing, or may start based on a user operation usingremote controller 20 or other instrument.

Entry field 203 is an area for entry of a free word to be used for thesearch on the web browser. While a cursor is displayed in entry field203, user 700 may enter a free word in entry field 203.

When user 700 utters a voice in this state toward remote controller 20,mobile terminal 30, or microphone 130 of television 10, a voice signalgenerated from the voice is input into voice acquisitor 101, and isconverted into voice information. Then, the voice information istransmitted from television 10 via network 40 to voice recognizer 50.For example, when user 700 utters “ABC”, the voice information based onthis voice is transmitted from television 10 to voice recognizer 50.

Voice recognizer 50 compares the voice information received fromtelevision 10 with the database to convert the voice information into acharacter string. Then, as a result of the voice recognition of thereceived voice information, voice recognizer 50 transmits information onthe character string (that is, character string information) via network40 to television 10. Voice recognizer 50 compares, when the receivedvoice information is generated from the voice of “ABC”, the voiceinformation with the database to convert the voice information into thecharacter string of “ABC”, and transmits the character stringinformation to television 10.

On receipt of the character string information from voice recognizer 50,television 10 causes recognition result acquisitor 103, intentioninterpretation processor 104, reserved word processor 106, displaycontroller 108, and any other unit to operate in accordance with thecharacter string information. Television 10 then displays the characterstring corresponding to the character string information in entry field203. For example, on receipt of the character string information thatindicates the character string of “ABC” from voice recognizer 50,television 10 displays the character string of “ABC” in entry field 203.

Thus, voice recognition apparatus 100 may acquire the voice uttered bythe user, convert the voice into the character string by performingvoice recognition processing, and input the character string intotelevision 10.

Then, the web browser displayed on display 140 of television 10 performsa free word search with the character string displayed in entry field203.

Note that as an example of search processing performed by television 10,the present exemplary embodiment has cited an example of starting theweb browser and performing the Internet search based on the free word,as described above. However, the search processing to be performed bytelevision 10 is not limited to this example. For example, television 10may perform processing such as search processing for searching a programguide for a program based on the free word, and search processing forsearching a plurality of recorded programs for a recorded program basedon the free word.

Also, television 10 may include a command processor that executescommand processing based on a reserved word when the reserved word isnot a search command. Examples of such a reserved word include wordssuch as “play” for instructing to play a recorded program and “record”for instructing to record a program.

Next, free word search processing to be performed by web browsercontroller 170 of television 10 according to the present exemplaryembodiment will be described with reference to FIG. 6 to FIG. 9.

FIG. 6 is diagram illustrating an example of the web browser displayedon display 140 of television 10 according to the first exemplaryembodiment.

When television 10 starts the web browser and displays a search site onthe web browser, as illustrated in FIG. 6 for example, display 140displays URL bar 301 of the web browser, search bar 302 of the webbrowser, web page 303 of the search site the web browser displays, aplurality of buttons 304 on web page 303, search character string entryarea 305 on web page 303, a plurality of search results 306 on web page303, login character string entry area 307 on web page 303,advertisement 308 displayed on web page 303, search character stringentry area 309 in advertisement 308, and any other component.

FIG. 7 is a flow chart illustrating an operation example of the voicesearch processing to be performed by voice recognition system 11according to the first exemplary embodiment.

FIG. 8 is a flow chart illustrating an operation example of voicerecognition intention interpretation processing to be performed by voicerecognition system 11 according to the first exemplary embodiment.

FIG. 9 is a flow chart illustrating an operation example of searchcategory determination processing to be performed by voice recognitionsystem 11 according to the first exemplary embodiment.

As illustrated in FIG. 7, in voice recognition system 11, after thevoice search processing starts, voice recognition intentioninterpretation processing is performed first (step S701).

The voice recognition intention interpretation processing of step S701will be described with reference to FIG. 8.

As described above, in television 10, an operation of user 700, forexample, pressing the microphone button of remote controller 20 causesvoice recognition apparatus 100 to start voice recognition processing.

When user 700 utters a voice in this state, the voice of user 700 isconverted into a voice signal by microphone 130, the microphone ofremote controller 20, or the microphone of mobile terminal 30, and thevoice signal is input into voice acquisitor 101. In this way, voiceacquisitor 101 acquires the voice signal of user 700 (step S801).

Voice acquisitor 101 converts the acquired voice signal of user 700 intovoice information that may be used for various types of downstreamprocessing, and outputs the voice information to voice processor 102.When user 700 utters, for example, “Search images for ABC”, voiceacquisitor 101 outputs the voice information based on the voice to voiceprocessor 102.

Voice processor 102 compares the voice information that is output fromvoice acquisitor 101 with reserved word table D400 previously stored instorage 180. Then, voice processor 102 examines whether the voiceinformation that is output from voice acquisitor 101 includesinformation corresponding to the reserved words registered in reservedword table D400 (step S802).

For example, when the voice information that is output from voiceacquisitor 101 includes voice information based on a word of “search”uttered by user 700, and when “search” has been registered in reservedword table D400 as the reserved word with the attribute of “command”,voice processor 102 determines that the word of “search”, which is thereserved word with the attribute of “command”, is included in the voiceinformation.

Then, voice processor 102 outputs the reserved word information thatrepresents the reserved word included in the voice information and theattribute of the reserved word to recognition result acquisitor 103.

Note that, as illustrated in FIG. 3, words to be used for operationssuch as the operation of television 10 and the operation of the webbrowser are registered in reserved word table D400 as the reservedwords.

In addition, reserved word table D400 is configured to allow updatethrough addition, deletion, or any other operation of the reserved word.For example, user 700 may add a new reserved word to reserved word tableD400. Alternatively, a new reserved word may be added to reserved wordtable D400 via network 40. This allows voice recognition apparatus 100to perform voice recognition processing in accordance with updatedreserved word table D400.

In addition, in step S802, voice processor 102 transmits the voiceinformation that is output from voice acquisitor 101, fromtransmitter-receiver 150 via network 40 to voice recognizer 50.

Voice recognizer 50 converts the received voice information intocharacter strings with a free word being isolated from a word other thanthe free word (for example, prepositions, particles, and the like).

Voice recognizer 50 compares the database that associates the free wordwith the character string, with the received voice information. When thereceived voice information includes the free word registered in thedatabase, voice recognizer 50 selects the character string (includingone character) corresponding to the free word. Note that the reservedword may be included in the free word in voice recognizer 50. In thisway, voice recognizer 50 converts the received voice information intothe character strings. For example, when voice recognizer 50 receivesvoice information based on the voice “Search images for ABC” uttered byuser 700, voice recognizer 50 converts the voice information into thecharacter strings of “search”, “images”, “for”, and “ABC”. Voicerecognizer 50 transmits character string information representing eachof the converted character strings (text information) via network 40 totelevision 10 (step S802).

This database, which is included in voice recognizer 50, may be atanother place on network 40. Also, this database may be configured sothat registered information may be updated regularly or irregularly.

Recognition result acquisitor 103 of television 10 acquires the reservedword information that is output from voice processor 102 as a result ofthe voice recognition (this reserved word information includes thereserved word and the attribute), and the character string informationtransmitted from voice recognizer 50 as a result of the voicerecognition (this character string information includes the free wordand the reserved word). Recognition result acquisitor 103 then outputsthe reserved word information and the character string information tointention interpretation processor 104.

In order to specify intention of the voice operation uttered by user700, intention interpretation processor 104 sorts the character stringinformation based on the reserved word information acquired fromrecognition result acquisitor 103 (step S803).

Intention interpretation processor 104 removes, for example, thereserved word from the character string information by collating thecharacter string information with the reserved word information. Here,intention interpretation processor 104 may compare reserved word tableD400 illustrated as an example in FIG. 3 with the character stringinformation. When the reserved word registered in reserved word tableD400 is included in the character string information, intentioninterpretation processor 104 may remove the word as the reserved wordfrom the character string information.

Then, intention interpretation processor 104 sorts the free word byremoving the character strings that do not correspond to the free word,such as prepositions and particles, from the remaining character stringinformation. Therefore, the reserved word is not included in the freeword in intention interpretation processor 104.

For example, when acquiring the character string information such as“search”, “images”, “for”, and “ABC”, and the reserved word informationrepresenting “images” and “search”, intention interpretation processor104 sorts “ABC” as the free word, “images” as the reserved word with theattribute of “search”, and “search” as the reserved word with theattribute of “command”. Since intention interpretation processor 104performs such sorting, television 10 may perform an operation inaccordance with the intention of user 700 (intention of the voiceoperation uttered by user 700). In the above example, regarding thereserved word of “images” with the attribute of “search”, television 10may execute the reserved word of “search” with the attribute of“command” using the free word of “ABC”.

Note that the reserved word with the attribute of “search” is, forexample, a term to be used for specification of the search category, asillustrated as an example in FIG. 3, such as “images”, “videos”,“program”, “Web”, “news”, “maps”, “restaurant”, and “gas station”. Also,the reserved word with the attribute of “command” is, for example, aterm to be used for operations of television 10 such as “proceed”,“return”, and “search”, as illustrated as an example in FIG. 3. However,the reserved word is not limited to these terms.

Note that intention interpretation processor 104 may perform intentioninterpretation using the character string included in the characterstring information, such as a particle and a preposition.

Thus, intention interpretation processor 104 performs voice recognitionintention interpretation processing of step S701.

Next, based on the processing of step S701, reserved word processor 106determines whether information that is output from intentioninterpretation processor 104 includes the reserved word with theattribute of “search” (step S702).

When the determination is made in step S702 that the information that isoutput from intention interpretation processor 104 includes the reservedword with the attribute of “search” (Yes), the flow chart goes to stepS704. Then, search URL creator 171 of web browser controller 170 createsthe search URL based on the reserved word and free word included in theinformation that is output from intention interpretation processor 104(step S704).

In step S704, search URL creator 171 refers to search URL list D500stored in storage 180 based on the reserved word, and creates the searchURL.

For example, when the words of “ABC” and “images” are included in theinformation that is output from intention interpretation processor 104,since “images” has been registered in reserved word table D400 as thereserved word with the attribute of “search”, determination of “Yes” ismade in step S702.

Then, search URL creator 171 creates the search URL that uses the freeword of “ABC” and the reserved word of “images”. Based on the reservedword of “images”, search URL creator 171 reads the search URL templatecorresponding to the reserved word of “images” from search URL list D500(FIG. 4). Then, search URL creator 171 adds the free word to the readsearch URL template. Accordingly, for example, based on the example ofsearch URL list D500 illustrated in FIG. 4, the search URL of thefollowing character string is created.

http://search.co.jp/images?q=ABC

URL bar display controller 173 inputs the search URL created in stepS704 into URL bar 301 (FIG. 6) of the web browser. Accordingly, a searchbased on the search URL is performed in search site 70. Information onthe web page resulting from the search is read into the web browser byweb page display controller 174. Then, the information on the web pageis displayed by display controller 108 on the web browser displayed ondisplay 140 (step S705).

In step S705, for example, an image as illustrated in FIG. 6 isdisplayed on display 140. FIG. 6 illustrates an example of the image inwhich a result of search of the reserved word of “images” for the freeword of “ABC” is displayed on web page 303. Note that web page 303illustrated in FIG. 6 is an example of the web page which is acquiredfrom search site 70 and which is displayed on the web browser.Arrangement of each element, a method for displaying the search result,presence of advertisement, a number of character string entry areas, andany other design in web page 303 are not limited to the exampleillustrated in FIG. 6. Design of the web page may differ from searchsite to search site.

When the determination is made in step S702 that the information that isoutput from intention interpretation processor 104 does not include thereserved word with the attribute of “search” (No), the flow chart goesto step S703. Then, search URL matching processor 172 of web browsercontroller 170 performs processing for determining the search category.Hereinafter, the processing to be performed in step S703 may be referredto as “search category determination processing”.

The search category indicates the range to be searched, and correspondsto the reserved word in search URL list D500 of FIG. 4 in the presentexemplary embodiment.

Here, in what situation the determination of “No” is made in step S702will be described with reference to a specific example.

It is assumed here that user 700 first utters “Search images for ABC”,and that a search for the reserved word of “images” has already beenperformed for the free word of “ABC”. It is assumed that, as a result ofthe search, for example, web page 303 illustrated in FIG. 6 has beendisplayed on display 140.

Note that this search is not limited to a search through the voiceoperation. For example, this search may be performed based on the useroperation (input operation) using the input unit (for example, touchpad)of remote controller 20 and the input unit (for example, touch panel) ofmobile terminal 30.

Following this state, when user 700 presses, for example, button 304 of“Map” on web page 303, the search category is changed from “images” to“maps” on the web browser. This operation may be performed by either thevoice operation or the user operation using the input unit of remotecontroller 20 or mobile terminal 30 (that is, input operation by theuser).

Accordingly, the web page displayed on the web browser moves from webpage 303 illustrated in FIG. 6 to the web page corresponding to thesearch category of “maps” (for example, a web page of maps). That is, inaccordance with information transmitted from the website that posts mapinformation, the web page of maps is displayed on the web browser.

Subsequently, when user 700 utters only a free word of “XYZ”, as aresult of voice recognition intention interpretation processing of stepS701, only information that indicates a character string of the freeword of “XYZ” is output from intention interpretation processor 104.

Reserved word processor 106 determines that the reserved word with theattribute of “search” is not included in the information that is outputfrom intention interpretation processor 104. Therefore, determination of“No” is made in step S702. Then, the flow chart goes to step S703.

In step S703, the search category determination processing is performed.Specifically, search URL matching processor 172 performs matchingprocessing (matching processing of character strings) between search URLlist D500 and the URL managed by URL bar display controller 173 (URLrelated to the web page displayed on display 140), and determines thesearch category based on a result of the matching processing (stepS703).

Details of the search category determination processing of step S703will be described with reference to FIG. 9.

In order to determine the search category, search URL matching processor172 performs matching processing between the search URL templatesregistered in search URL list D500 and the URL managed by URL bardisplay controller 173, in predetermined order. The followingdescription assumes that this order is order in which the search URLtemplates are registered in search URL list D500 (in the order from topof search URL list D500 illustrated in FIG. 4). However, the order ofthe matching processing is not limited to this order, and may be anotherorder.

Search URL matching processor 172 substitutes an initial value “1” forvariable n (step S901).

Next, search URL matching processor 172 acquires the character string ofthe URL managed by URL bar display controller 173, the character stringbeing displayed in URL bar 301 (for example, URL bar 301 illustrated inFIG. 6) of the web browser (step S902). Hereinafter this characterstring is referred to as character string A.

Next, search URL matching processor 172 acquires, from search URL listD500, the character string of the nth registered search URL template insearch URL list D500 (step S903).

In the example illustrated in FIG. 4, the search category with n=1 is“web” in a first line of search URL list D500, and the correspondingsearch URL template is as follows.

http://search.co.jp/web?q=

The search category with n=2 is “images” in a second line of search URLlist D500, and the corresponding search URL template is as follows.

http://search.co.jp/images?q=

The search category with n=9 is “gas station” in a ninth line of searchURL list D500, and the corresponding search URL template is as follows.

http://web-navi.co.jp/search?category=gas-station&query=

Thus, search URL matching processor 172 reads the search URL templatesfrom search URL list D500 in accordance with variable n. Hereinafter,the character string of the search URL template corresponding to thevariable n is referred to as character string B.

Next, search URL matching processor 172 determines whether any characterstring that agrees with character string B is included in characterstring A (step S904).

Here, this determination to be performed by search URL matchingprocessor 172 in step S904 is referred to as matching processing. Whenit is determined in step S904 that a character string that agrees withcharacter string B is included in character string A (that is, when thematching processing has succeeded) (Yes), the flow chart goes to stepS905. Then, search URL matching processor 172 sets the search categorythat is nth registered in search URL list D500 as a return value, basedon search URL list D500 (step S905).

For example, when character string B is

http://search.co.jp/images?q=

and character string A is

http://search.co.jp/images?q=ABC,

the character string that agrees with character string B is included incharacter string A, and thus the determination of “Yes” is made in stepS904.

In the example illustrated in FIG. 4, if the determination of “Yes” ismade in step S904 when n=1, “web” firstly registered in search URL listD500 is set as the return value in step S905. If the determination of“Yes” is made when n=2, “images” secondly registered in search URL listD500 is set as the return value.

When it is determined in step S904 that the character string that agreeswith character string B is not included in character string A (that is,when the matching processing has failed) (No), search URL matchingprocessor 172 adds 1 to variable n (step S906).

Next, search URL matching processor 172 collates variable n with anumber of search categories registered in search URL list D500 (a numberof search URL templates) (step S907).

When it is determined in step S907 that variable n is larger than thenumber of search categories registered in search URL list D500 (Yes),search URL matching processor 172 sets a preset search category(hereinafter referred to as “default search category”) as the returnvalue (step S908).

Step S908 is processing to be performed when the character string thatagrees with character string B is included in none of the search URLtemplates registered in search URL list D500. Note that although thedefault search category is, for example, “web”, the default searchcategory may be another search category.

When it is determined in step S907 that variable n is equal to or lessthan the number of search categories registered in search URL list D500(No), search URL matching processor 172 returns to step S903 andcontinues the matching processing. This processing is performed when thesearch URL template on which the matching processing has not beenperformed exists in search URL list D500.

Note that although variable n is used in the flow chart illustrated inFIG. 9 for repetitive processing, repetitive processing using aniterator may be performed.

Thus, when it is determined in step S702 that the reserved word with theattribute of “search” is not included in the information that is outputfrom intention interpretation processor 104 (No), search URL matchingprocessor 172 determines the search category by performing searchcategory determination processing of step S703. Then, search URLmatching processor 172 sets the determined search category as the returnvalue, and outputs the return value to search URL creator 171 of webbrowser controller 170.

After completion of step S703, the flow chart goes to step S704. Then,search URL creator 171 reads, from search URL list D500, the search URLtemplate corresponding to the return value that is output from searchURL matching processor 172 (that is, the search category determined instep S703), and creates the search URL based on the read search URLtemplate and the free word included in the information that is outputfrom intention interpretation processor 104 (step S704). Processing ofstep S705 following step S704 is as described above.

Note that search URL matching processor 172 may set variable n as thereturn value. In this case, search URL creator 171 only needs to readthe search URL template from search URL list D500 based on variable n.

The above search category determination processing will be described byciting a specific example. Here, a continuation of the above specificexample of the determination of “No” in step S702 will be described asthe specific example of the search category determination processing.

In this example, the information that is output from intentioninterpretation processor 104 does not include any reserved word with theattribute of “search”, but includes only information that indicates thecharacter string of the free word of “XYZ”.

Also in this example, on the web browser is displayed a web page of amap obtained by a previous search performed by user 700. Therefore,search URL matching processor 172 sets the character string of the URLfor displaying the web page of a map in character string A in step S902.This character string is, for example, as follows.

http://search.co.jp/maps?q=

If the search category of “maps” is searched for the free word of “ABC”,the character string to be set in character string A is as follows, forexample.

http://search.co.jp/maps?q=ABC

Accordingly, when n=5 in step S904, that is, when the following searchURL template corresponding to the search category of “maps” is set incharacter string B, search URL matching processor 172 succeeds in thematching processing.

http://search.co.jp/maps?q=

That is, it is determined that the character string that agrees withcharacter string B is included in character string A. Therefore, in stepS905, search URL matching processor 172 sets the search category of“maps” as the return value, and outputs the return value to search URLcreator 171.

Search URL creator 171 reads, from search URL list D500, the followingsearch URL template corresponding to the return value (here, the searchcategory of “maps”) that is output from search URL matching processor172 in step S704.

http://search.co.jp/maps?q=

Then, based on the read search URL template and the free word of “XYZ”included in the information that is output from intention interpretationprocessor 104, search URL creator 171 creates the following search URL.

http://search.co.jp/maps?q=XYZ

In step S705, URL bar display controller 173 inputs into URL bar 301 ofthe web browser the above search URL created by search URL creator 171.Information on the web page obtained from search site 70 resulting fromthis search (that is, a result of the search of the search category of“maps” for the free word of “XYZ”) is read into the web browser by webpage display controller 174. Then, the information on the web page isdisplayed by display controller 108 on the web browser displayed ondisplay 140.

[1-3. Effect, Etc.]

As described above, in the present exemplary embodiment, the voicesearch apparatus includes the web page display controller, the voicerecognizer, the matching unit, and the URL creator. The web page displaycontroller controls the web page which is to be displayed on thedisplay. The voice recognizer recognizes the voice that is input,converts the voice into character string information, and outputs thecharacter string information. The matching unit refers to the URL listin which the character string representing part of the URL is registeredfor each search category, performs matching processing between the URLof the web page displayed on the display and the character string in theURL list, and specifies the character string corresponding to the URL inthe URL list. The URL creator creates the URL based on the characterstring specified by the matching unit and the character stringinformation.

Note that voice search apparatus 200 is an example of the voice searchapparatus, web page display controller 174 is an example of the web pagedisplay controller, voice recognizer 50 is an example of the voicerecognizer, search URL matching processor 172 is an example of thematching unit, search URL list D500 is an example of the URL list, andsearch URL creator 171 is an example of the URL creator.

When performing search processing continuously, user 700 who uses voicesearch apparatus 200 configured in this way may perform a new searchbased on contents of a previous search and new contents to be uttered,without uttering again contents that are input in the previous search.

For example, when searching, for a new free word, a search categoryidentical to the search category with which the previous search has beenmade, user 700 may omit utterance of the search category that overlapsthe previous search. That is, user 700 only needs to utter the new freeword.

As a specific example, after searching the search category of “maps” forthe free word of “ABC” by uttering or input-operating “Search maps forABC” immediately before, when performing a new search of the searchcategory of “maps” for the free word of “XYZ”, user 700 does not need toutter “Search maps for XYZ”. User 700 only needs to utter “XYZ”.Accordingly, in voice search apparatus 200, the search URL is created sothat the search for the free word of “XYZ” may be performed on thesearch category of “maps” identical to the search category of which theprevious search has been made. That is, only by uttering “XYZ”, user 700may perform search processing identical to the processing performed whenuttering “Search maps for XYZ”.

Thus, in voice search apparatus 200 according to the present exemplaryembodiment, user 700 does not need to repeatedly utter a word thatoverlaps a word uttered at a time of the previous voice operation. Thisreduces complicatedness of performing voice operation of a device thatincludes voice search apparatus 200, and improves operativity.

Other Exemplary Embodiments

As described above, the first exemplary embodiment has been described asan example of a technique disclosed in the present application. However,the technique in the present disclosure is not limited to this example,and may be applied to exemplary embodiments to which change,replacement, addition, and omission have been made. In addition, it isalso possible to make a new exemplary embodiment by combining elementsdescribed in the above first exemplary embodiment.

Therefore, other exemplary embodiments will be described below.

Search URL matching processor 172 may perform matching processing with aremaining section except for a previously prescribed character string(for example, a URL scheme), such as “http” and “https”.

Also, when performing matching processing, search URL matching processor172 may read one search URL template from search URL list D500, extracta domain name and query string, and determine whether each extractedcharacter string is included in each of a domain name and query stringof the URL managed by URL bar display controller 173.

A specific example will be cited to describe this operation example. Forexample, the following character string is registered in search URL listD500 illustrated in FIG. 4, as the search URL template corresponding tothe search category of “gas station”.

http://web-navi.co.jp/search?category=gas-station&query=

From this search URL template, a domain name of “web-navi.co.jp” and aquery string of “category=gas-station” may be extracted, through the useof “:”, which is a delimiter for a URL scheme and “?”, which is adelimiter for a query string. Therefore, search URL matching processor172 makes a determination that the matching processing has succeeded,when this character string of the domain name is included in thecharacter string of the domain name of the URL managed by URL bardisplay controller 173, and when this character string of the querystring is included in the character string of the query string of theURL managed by URL bar display controller 173. Note that it is alsopossible to configure search URL matching processor 172 to performmatching processing using the domain name and path, and matchingprocessing using the path and query string, in addition to theabove-described matching processing using the domain name and querystring.

Note that search URL list D500 is not limited to data structureillustrated in FIG. 4. Search URL list D500 may be data structure inwhich the domain name, path, and query string are previously separated,and an element to be used for search category determination processingand other elements are separately registered. One specific example ofthis data structure will be described. For example, in the example ofthe search URL template corresponding to the search category of “gasstation” in search URL list D500 illustrated in FIG. 4, the domain nameof “web-navi.co.jp”, the path of “/search”, and the query string of“category=gas-station” are registered in search URL list D500 as theelements to be used for search category determination processing. Inaddition, the URL scheme of “http://”, the path of “/search”, and otheritem “q=” of the query string are registered in search URL list D500 asthe elements which are not used for search category determinationprocessing. Search URL list D500 may have such data structure.

Such data structure of search URL list D500 makes it possible to reducea load of matching processing when search URL matching processor 172performs search category determination processing.

When performing search category determination processing, search URLmatching processor 172 may perform matching processing as follows.Search URL matching processor 172 does not perform matching processingon a search URL template corresponding to a default search category, andperforms matching processing on search categories other than the defaultsearch category. Then, when all of the matching processing fails, searchURL matching processor 172 sets the search category as the defaultsearch category.

Also, in the present exemplary embodiment, the operation example hasbeen described in which voice search apparatus 200 makes a search on theInternet. However, the search to be performed by voice search apparatus200 is not limited to the Internet search. Voice search apparatus 200may perform, for example, a program search by a program guideapplication. In case of the program search by the program guideapplication, for example, as illustrated in the search URL templatecorresponding to the search category of “program” in search URL listD500 of FIG. 4, a “file” scheme or a scheme of an application name suchas “epg_app” may be used. Alternatively, voice search apparatus 200 maysearch recorded programs, etc.

Note that, in television 10, voice recognition processing may be startedby the aforementioned method. Therefore, after voice recognitionprocessing is started, user 700 may perform the above search even whilewatching a program on television 10.

Television 10 may include a command processor that performs commandprocessing other than search. Examples of this command processinginclude channel change and volume change of television 10. In addition,a command that may be executed by this command processor may beregistered in reserved word table D400 as the reserved word with theattribute of “command”, for example, “channel up”, “volume up”,“playback”, “stop”, “display character”, and any other command.

Voice search apparatus 200 described in the present exemplary embodimentmay be included in a device that does not include display 140.

While the example in which voice recognizer 50 is disposed on network 40has been described in the present exemplary embodiment, voice recognizer50 may be included in voice search apparatus 200.

In the present exemplary embodiment, reserved word table D400 and searchURL list D500 may be stored in either storage 180 or storage 181.Alternatively, at least one of reserved word table D400 and search URLlist D500 may be acquired via network 40 as necessary.

Note that each block illustrated in FIG. 2 may be configured as anindependent circuit block, and may be configured such that a processormay execute software that is programmed to implement the operation ofeach block.

INDUSTRIAL APPLICABILITY

The present disclosure is applicable to devices that perform processingoperations instructed by users. Specifically, the present disclosure isapplicable to devices such as a car navigation system, a driving supportsystem, a mobile terminal device, a television receiver, a personalcomputer (PC), a PC peripheral such as a printer, a set top box, a homeaudio visual system such as a video recorder and an optical disk player,a remote controller, a game machine, a smart phone, a tablet terminal,and a home electric appliance such as a microwave oven.

REFERENCE MARKS IN THE DRAWINGS

-   -   10 television receiver    -   11 voice recognition system    -   20 remote controller    -   30 mobile terminal    -   40 network    -   50 voice recognizer    -   60 URL server    -   70 search site    -   100 voice recognition apparatus    -   101 voice acquisitor    -   102 voice processor    -   103 recognition result acquisitor    -   104 intention interpretation processor    -   106 reserved word processor    -   108 display controller    -   110 operation acceptor    -   130 microphone    -   140 display    -   150 transmitter-receiver    -   170 web browser controller    -   171 search URL creator    -   172 search URL matching processor    -   173 URL bar display controller    -   174 web page display controller    -   180,181 storage    -   190 tuner    -   200 voice search apparatus    -   201 voice recognition icon    -   202 indicator    -   203 entry field    -   301 URL bar    -   302 search bar    -   303 web page    -   304 button    -   305 search character string entry area    -   306 search result    -   307 login character string entry area    -   308 advertisement    -   309 search character string entry area    -   700 user    -   D400 reserved word table    -   D500 search URL list

The invention claimed is:
 1. A voice search apparatus comprising: a webpage display controller that controls a web page which is to bedisplayed on a display; a voice processor that receives voiceinformation from a voice that is input, determines if a preset reservedword is included in the voice information, and outputs reserved wordinformation including an attribute of the reserved word; a voicerecognizer that converts the voice information into character stringinformation, and outputs the character string information; a storagethat stores a URL list in which a character string representing part ofa URL is registered for a search category; a processor including amatching unit that refers to the URL list, performs matching processingbetween a first URL of the web page displayed on the display and thecharacter string in the URL list, and specifies the character string inthe URL list corresponding to the first URL; and a URL creator thatcreates a second URL based on the character string specified by thematching unit and the character string information, wherein theprocessor determines whether or not the reserved word information isincluded in the character string information, wherein the URL creatorcreates the second URL based on the character string specified by thematching unit and the character string information when the processordetermines that the reserved word information is not included in thecharacter string information, wherein the URL creator creates a thirdURL based on the URL list registered for the search category relevant tothe reserved word information and the character string informationexcept the reserved word information when the processor determines thatthe reserved word information is included in the character stringinformation, and wherein the web page display controller displays theweb page, relevant to the second URL or the third URL created by the URLcreator, on the display.
 2. A voice search method comprising: displayinga web page on a display; recognizing a voice that is input andconverting the voice into character string information; and when aninformation relevant to a search category is not included in thecharacter string information, referring to a URL list in which acharacter string representing part of a URL is registered for the searchcategory, performing matching processing between a first URL of the webpage displayed on the display and the character string in the URL list,and specifying the character string in the URL list corresponding to thefirst URL via a processor, and creating a second URL based on thespecified character string and the character string information, whenthe information relevant to the search category is included in thecharacter string information, creating a third URL based on the URL listregistered for the search category and the character string informationexcept the information relevant to the search category, and displayingthe web page, relevant to the second URL or the third URL, on thedisplay.
 3. A display apparatus comprising: a display; a web pagedisplay controller that controls a web page which is to be displayed onthe display; a voice processor that receives voice information from avoice that is input, determines if a preset reserved word is included inthe voice information, and outputs reserved word information includingan attribute of the reserved word; a voice recognizer that converts thevoice information into character string information, and outputs thecharacter string information; a storage that stores a URL list in whicha character string representing part of a URL is registered for a searchcategory; a processor including a matching unit that refers to the URLlist, performs matching processing between a first URL of the web pagedisplayed on the display and the character string in the URL list, andspecifies the character string in the URL list corresponding to thefirst URL; and a URL creator that creates a second URL based on thecharacter string specified by the matching unit and the character stringinformation, wherein the processor determines whether or not thereserved word information is included in the character stringinformation, wherein the URL creator creates the second URL based on thecharacter string specified by the matching unit and the character stringinformation when the processor determines that the reserved wordinformation is not included in the character string information, whereinthe URL creator creates a third URL based on the URL list registered forthe search category relevant to the reserved word information and thecharacter string information except the reserved word information whenthe processor determines that the reserved word information is includedin the character string information, and wherein the web page displaycontroller displays the web page, relevant to the second URL or thethird URL created by the URL creator, on the display.